During the last decade, Speech Emotion Recognition (SER) has emerged as an integral component within Human-computer Interaction (HCI) and other high-end speech processing systems. Generally, an SER system targets the speaker's existence of varied emotions by extracting and classifying the prominent features from a preprocessed speech signal. However, the way humans and machines recognize and correlate emotional aspects of speech signals are quite contrasting quantitatively and qualitatively, which present enormous difficulties in blending knowledge from interdisciplinary fields, particularly speech emotion recognition, applied psychology, and human-computer interface. The paper carefully identifies and synthesizes recent relevant literature related to the SER systems' varied design components/methodologies, thereby providing readers with a state-of-the-art understanding of the hot research topic. Furthermore, while scrutinizing the current state of understanding on SER systems, the research gap's prominence has been sketched out for consideration and analysis by other related researchers, institutions, and regulatory bodies.

A Comprehensive Review of Speech Emotion Recognition Systems / Taiba Majid, Taiba Majid; Teddy Surya, Gunawan; Syed Asif Ahmad, Qadri; Mira, Kartiwi; Eliathamby, Ambikairajah. - In: IEEE ACCESS. - ISSN 2169-3536. - 9:(2021), pp. 47795-47814. [10.1109/access.2021.3068045]

A Comprehensive Review of Speech Emotion Recognition Systems

Wani, Taiba Majid
Primo
Writing – Original Draft Preparation
;
2021

Abstract

During the last decade, Speech Emotion Recognition (SER) has emerged as an integral component within Human-computer Interaction (HCI) and other high-end speech processing systems. Generally, an SER system targets the speaker's existence of varied emotions by extracting and classifying the prominent features from a preprocessed speech signal. However, the way humans and machines recognize and correlate emotional aspects of speech signals are quite contrasting quantitatively and qualitatively, which present enormous difficulties in blending knowledge from interdisciplinary fields, particularly speech emotion recognition, applied psychology, and human-computer interface. The paper carefully identifies and synthesizes recent relevant literature related to the SER systems' varied design components/methodologies, thereby providing readers with a state-of-the-art understanding of the hot research topic. Furthermore, while scrutinizing the current state of understanding on SER systems, the research gap's prominence has been sketched out for consideration and analysis by other related researchers, institutions, and regulatory bodies.
2021
Speech recognition; Databases; Feature extraction; Emotion recognition; Speech processing; Task analysis; Neural networks; Speech emotion recognition; database; preprocessing; feature extraction; classifier
01 Pubblicazione su rivista::01a Articolo in rivista
A Comprehensive Review of Speech Emotion Recognition Systems / Taiba Majid, Taiba Majid; Teddy Surya, Gunawan; Syed Asif Ahmad, Qadri; Mira, Kartiwi; Eliathamby, Ambikairajah. - In: IEEE ACCESS. - ISSN 2169-3536. - 9:(2021), pp. 47795-47814. [10.1109/access.2021.3068045]
File allegati a questo prodotto
File Dimensione Formato  
Wani_A-Comprehensive_2021.pdf

accesso aperto

Note: DOI 10.1109/ACCESS.2021.3068045
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 5.5 MB
Formato Adobe PDF
5.5 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1713909
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 277
  • ???jsp.display-item.citation.isi??? 143
social impact